Temporal difference learning

Results: 95



#Item
71Computational neuroscience / Cognitive science / Educational psychology / Motivation / Human behavior / Dopamine / Temporal difference learning / Learning / Reinforcement learning / Behavior / Mind / Ethology

PII: S0376[removed]X

Add to Reading List

Source URL: www.ncbi.nlm.nih.gov

Language: English
72Computational neuroscience / Cybernetics / Reinforcement learning / Q-learning / Temporal difference learning / SARSA / Markov decision process / Unsupervised learning / Recurrent neural network / Machine learning / Neural networks / Statistics

Playing Atari with Deep Reinforcement Learning Volodymyr Mnih Koray Kavukcuoglu

Add to Reading List

Source URL: arxiv.org

Language: English - Date: 2013-12-19 20:23:45
73Neuroscience / Neural networks / Artificial intelligence / Cognitive science / Learning / Q-learning / Artificial neural network / Reinforcement learning / Temporal difference learning / Computational neuroscience / Machine learning / Science

A neural model of hierarchical reinforcement learning Daniel Rasmussen ([removed]) Chris Eliasmith ([removed]) Centre for Theoretical Neuroscience, University of Waterloo Waterloo, ON, Canada, N

Add to Reading List

Source URL: mindmodeling.org

Language: English - Date: 2014-08-06 20:59:14
74Cybernetics / Reinforcement learning / Temporal difference learning / Computational statistics / Reinforcement / Q-learning / Artificial neural network / Machine learning / Neural networks / Computational neuroscience

A neural reinforcement learning model for tasks with unknown time delays Daniel Rasmussen, Chris Eliasmith {drasmuss,celiasmith}@uwaterloo.ca Centre for Theoretical Neuroscience, University of Waterloo Add to Reading List

Source URL: compneuro.uwaterloo.ca

Language: English - Date: 2013-09-18 09:52:49
75Temporal difference learning / Reinforcement / Nao / Artificial intelligence / Behavior / Philosophy of psychology / Behaviorism / Reinforcement learning / Robot

explore.stocks32.1.7.sum.eps

Add to Reading List

Source URL: apps.cs.utexas.edu

Language: English - Date: 2012-12-13 11:51:50
76Ethology / Reinforcement learning / Artificial intelligence / Foraging / Reinforcement / Neural network / Temporal difference learning / Computational neuroscience / Science / Knowledge

Reinforcement Learning I: Temporal Differences Hal Daumé III Computer Science University of Maryland [removed]

Add to Reading List

Source URL: www.umiacs.umd.edu

Language: English - Date: 2012-02-23 13:17:35
77Mathematical analysis / Representation theory of Lie algebras / Weight / Normal distribution / Estimation theory / Poisson processes / Maximum spacing estimation / Symbol / Abstract algebra / Representation theory of Lie groups / Statistics

TDγ : Re-evaluating Complex Backups in Temporal Difference Learning George Konidaris∗† MIT CSAIL† Cambridge MA 02139

Add to Reading List

Source URL: lis.csail.mit.edu

Language: English - Date: 2012-06-08 19:44:28
78Preconditioner / Statistics / Reinforcement learning / Iterative method / Markov chain / Matrix / Sparse matrix / Applied mathematics / Markov models / Numerical linear algebra / Mathematics

Preconditioned Temporal Difference Learning Hengshuai Yao Zhi-Qiang Liu School of Creative Media, City University of Hong Kong, Hong Kong, China

Add to Reading List

Source URL: www.machinelearning.org

Language: English - Date: 2008-05-23 03:34:40
79Statistics / Mathematical analysis / Regularization / Tikhonov regularization / Learning / Supervised learning / Mathematical optimization / Least squares / Reinforcement learning / Machine learning / Mathematics / Linear algebra

A Dantzig Selector Approach to Temporal Difference Learning Matthieu Geist Sup´elec, IMS Research Group, Metz, France [removed]

Add to Reading List

Source URL: icml.cc

Language: English - Date: 2012-06-07 13:20:38
80Reinforcement learning / Bayesian statistics / Machine learning / Temporal difference learning / Bayesian inference / Reinforcement / Statistics / Computational neuroscience / Cybernetics

Reinforcement Learning yields an exact update shown to work well on reinforcement learning benchmarks. REPS can be generalized hierarchically [6] using a gating network to choose among several option policies. This hiera

Add to Reading List

Source URL: www.is.tuebingen.mpg.de

Language: English - Date: 2012-12-21 09:36:21
UPDATE